智能论文笔记

WKGM: Weight-K-space Generative Model for Parallel Imaging Reconstruction

Zongjiang Tu , Die Liu , Xiaoqing Wang , Chen Jiang , Minghui Zhang , Shanshan Wang , Qiegen Liu , Dong Liang

分类：计算机视觉

2022-05-08

近年来，基于深度学习的平行成像（PI）取得了巨大进展，以加速磁共振成像（MRI）。然而，现有方法的性能和鲁棒性仍然可以是不受欢迎的。在这项工作中，我们建议通过柔性PI重建，创建的重量K-Space Genera-Tive模型（WKGM）来探索K空间域学习。具体而言，WKGM是一种通用的K空间域模型，在其中有效地纳入了K空间加权技术和高维空间增强设计，用于基于得分的Genererative模型训练，从而实现良好和强大的重建。此外，WKGM具有灵活性，因此可以与各种传统的K空间PI模型协同结合，从而产生基于学习的先验以产生高保真重建。在具有不同采样模式和交流电因子的数据集上进行实验性重新构建表明，WKGM可以通过先验良好的K-Space生成剂获得最新的重建结果。

translated by 谷歌翻译

A Survey of Detection Methods for Die Attachment and Wire Bonding Defects in Integrated Circuit Manufacturing

Lamia Alam , Nasser Kehtarnavaz

分类：计算机视觉 | 机器学习

2022-06-02

缺陷检测在综合电路（ICS）的制造过程中起着至关重要的作用。模具附件和电线粘结是制造过程的两个步骤，这些步骤确定了IC中的功率和信号传输质量和可靠性。本文介绍了基于使用的不同感应方式（包括光学，放射学，声学和红外热力学）的不同感应方式来检测这些缺陷的方法的调查或文献综述。本调查提供了对所使用的检测方法的讨论。常规学习方法和深度学习方法用于检测模具依恋和电线粘结缺陷以及挑战和未来的研究方向。

translated by 谷歌翻译

Unsupervised Statistical Learning for Die Analysis in Ancient Numismatics

Andreas Heinecke , Emanuel Mayer , Abhinav Natarajan , Yoonju Jung

分类：计算机视觉

2021-12-01

模具分析是一种必不可少的数字方法，以及古代经济史的重要工具。然而，手动模具研究过于劳动密集型，可以全面研究罗马帝国等大型币。我们通过提出无监督计算模具分析的模型来解决这个问题，这可以减少大规模模具研究所需的时间投资，在许多情况下从多年到几周内完成了几个数量级。从计算机视觉观点来看，DIE研究提出了一个挑战的无监督的聚类问题，因为它们涉及一个不明显的和大量的高度相似的语义类别的不平衡尺寸。我们通过确定从贝叶斯距离聚类框架中的专门设计的基于高斯进程的关键点特征的硬币面之间的硬币面之间的异常来解决这些问题。通过分析1135罗马银币在64-66 C.E中进行了分析来证明我们的方法的功效。

translated by 谷歌翻译

A Unified Game-Theoretic Interpretation of Adversarial Robustness

Jie Ren , Die Zhang , Yisen Wang , Lu Chen , Zhanpeng Zhou , Yiting Chen , Xu Cheng , Xin Wang , Meng Zhou , Jie Shi

分类：机器学习 | 人工智能 | 计算机视觉

2021-11-05

本文提供了统一的观点来解释不同的对抗攻击和防御方法，\ emph {i.e.} DNN的输入变量之间的多阶交互的视图。根据多阶互动，我们发现对抗性攻击主要影响愚弄DNN的高阶相互作用。此外，我们发现前列培训的DNN的鲁棒性来自特定于类别的低阶交互。我们的研究结果提供了统一对抗的扰动和鲁棒性的潜在方法，可以以原则方式解释现有的防御方法。此外，我们的调查结果还修订了先前的不准确了解对抗普遍学习特征的偏差。

translated by 谷歌翻译

A Simplicial Model for $KB4_n$: Epistemic Logic with Agents that May Die

Éric Goubault , Jérémy Ledent , Sergio Rajsbaum

分类：人工智能

2021-08-23

The standard semantics of multi-agent epistemic logic S5 is based on Kripke models whose accessibility relations are reflexive, symmetric and transitive. This one dimensional structure contains implicit higher-dimensional information beyond pairwise interactions, that we formalized as pure simplicial models in a previous work (Information and Computation, 2021). Here we extend the theory to encompass simplicial models that are not necessarily pure. The corresponding class of Kripke models are those where the accessibility relation is symmetric and transitive, but might not be reflexive. Such models correspond to the epistemic logic KB4 . Impure simplicial models arise in situations where two possible worlds may not have the same set of agents. We illustrate it with distributed computing examples of synchronous systems where processes may crash.

translated by 谷歌翻译

A Unified Game-Theoretic Interpretation of Adversarial Robustness

Jie Ren , Die Zhang , Yisen Wang , Lu Chen , Zhanpeng Zhou , Yiting Chen , Xu Cheng , Xin Wang , Meng Zhou , Jie Shi

分类：机器学习 | 人工智能 | 计算机视觉

2021-03-12

本文提供了一个统一的观点来解释不同的逆势攻击和防御方法，即DNN的输入变量之间的多阶交互的视图。根据多阶互动，我们发现对抗性攻击主要影响愚弄DNN的高阶相互作用。此外，我们发现前列培训的DNN的鲁棒性来自特定于类别的低阶交互。我们的研究结果提供了统一对抗的扰动和鲁棒性的潜在方法，可以以原则方式解释现有的防御方法。此外，我们的调查结果还修订了先前的不准确了解对抗普遍学习特征的偏差。

translated by 谷歌翻译

Cross Modal Transformer via Coordinates Encoding for 3D Object Dectection

Junjie Yan , Yingfei Liu , Jianjian Sun , Fan Jia , Shuailin Li , Tiancai Wang , Xiangyu Zhang

分类：计算机视觉

2023-01-03

In this paper, we propose a robust 3D detector, named Cross Modal Transformer (CMT), for end-to-end 3D multi-modal detection. Without explicit view transformation, CMT takes the image and point clouds tokens as inputs and directly outputs accurate 3D bounding boxes. The spatial alignment of multi-modal tokens is performed implicitly, by encoding the 3D points into multi-modal features. The core design of CMT is quite simple while its performance is impressive. CMT obtains 73.0% NDS on nuScenes benchmark. Moreover, CMT has a strong robustness even if the LiDAR is missing. Code will be released at https://github.com/junjie18/CMT.

translated by 谷歌翻译

Backdoor Attacks Against Dataset Distillation

Yugeng Liu , Zheng Li , Michael Backes , Yun Shen , Yang Zhang

分类：机器学习

2023-01-03

Dataset distillation has emerged as a prominent technique to improve data efficiency when training machine learning models. It encapsulates the knowledge from a large dataset into a smaller synthetic dataset. A model trained on this smaller distilled dataset can attain comparable performance to a model trained on the original training dataset. However, the existing dataset distillation techniques mainly aim at achieving the best trade-off between resource usage efficiency and model utility. The security risks stemming from them have not been explored. This study performs the first backdoor attack against the models trained on the data distilled by dataset distillation models in the image domain. Concretely, we inject triggers into the synthetic data during the distillation procedure rather than during the model training stage, where all previous attacks are performed. We propose two types of backdoor attacks, namely NAIVEATTACK and DOORPING. NAIVEATTACK simply adds triggers to the raw data at the initial distillation phase, while DOORPING iteratively updates the triggers during the entire distillation procedure. We conduct extensive evaluations on multiple datasets, architectures, and dataset distillation techniques. Empirical evaluation shows that NAIVEATTACK achieves decent attack success rate (ASR) scores in some cases, while DOORPING reaches higher ASR scores (close to 1.0) in all cases. Furthermore, we conduct a comprehensive ablation study to analyze the factors that may affect the attack performance. Finally, we evaluate multiple defense mechanisms against our backdoor attacks and show that our attacks can practically circumvent these defense mechanisms.

translated by 谷歌翻译

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

Yue Han , Jiangning Zhang , Zhucun Xue , Chao Xu , Xintian Shen , Yabiao Wang , Chengjie Wang , Yong Liu , Xiangtai Li

分类：计算机视觉

2023-01-03

Few Shot Instance Segmentation (FSIS) requires models to detect and segment novel classes with limited several support examples. In this work, we explore a simple yet unified solution for FSIS as well as its incremental variants, and introduce a new framework named Reference Twice (RefT) to fully explore the relationship between support/query features based on a Transformer-like framework. Our key insights are two folds: Firstly, with the aid of support masks, we can generate dynamic class centers more appropriately to re-weight query features. Secondly, we find that support object queries have already encoded key factors after base training. In this way, the query features can be enhanced twice from two aspects, i.e., feature-level and instance-level. In particular, we firstly design a mask-based dynamic weighting module to enhance support features and then propose to link object queries for better calibration via cross-attention. After the above steps, the novel classes can be improved significantly over our strong baseline. Additionally, our new framework can be easily extended to incremental FSIS with minor modification. When benchmarking results on the COCO dataset for FSIS, gFSIS, and iFSIS settings, our method achieves a competitive performance compared to existing approaches across different shots, e.g., we boost nAP by noticeable +8.2/+9.4 over the current state-of-the-art FSIS method for 10/30-shot. We further demonstrate the superiority of our approach on Few Shot Object Detection. Code and model will be available.

translated by 谷歌翻译

Rethinking Mobile Block for Efficient Neural Models

Jiangning Zhang , Xiangtai Li , Jian Li , Liang Liu , Zhucun Xue , Boshen Zhang , Zhengkai Jiang , Tianxin Huang , Yabiao Wang , Chengjie Wang

分类：计算机视觉

2023-01-03

This paper focuses on designing efficient models with low parameters and FLOPs for dense predictions. Even though CNN-based lightweight methods have achieved stunning results after years of research, trading-off model accuracy and constrained resources still need further improvements. This work rethinks the essential unity of efficient Inverted Residual Block in MobileNetv2 and effective Transformer in ViT, inductively abstracting a general concept of Meta-Mobile Block, and we argue that the specific instantiation is very important to model performance though sharing the same framework. Motivated by this phenomenon, we deduce a simple yet efficient modern \textbf{I}nverted \textbf{R}esidual \textbf{M}obile \textbf{B}lock (iRMB) for mobile applications, which absorbs CNN-like efficiency to model short-distance dependency and Transformer-like dynamic modeling capability to learn long-distance interactions. Furthermore, we design a ResNet-like 4-phase \textbf{E}fficient \textbf{MO}del (EMO) based only on a series of iRMBs for dense applications. Massive experiments on ImageNet-1K, COCO2017, and ADE20K benchmarks demonstrate the superiority of our EMO over state-of-the-art methods, \eg, our EMO-1M/2M/5M achieve 71.5, 75.1, and 78.4 Top-1 that surpass \textbf{SoTA} CNN-/Transformer-based models, while trading-off the model accuracy and efficiency well.

translated by 谷歌翻译